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1)  Statement  of  problem  studied 

As  part  of  this  project,  we  studied  how  the  evolution  of  thermostability  influences 
protein  dynamics.  Molecular  dynamics  simulations  were  employed  to  study  how  protein 
solution  structure  and  dynamics  are  affected  by  adaptation  to  high  temperature. 

In  2001  we  shifted  the  focus  of  this  project  to  developing  structure-based  computational 
approaches  for  improving  molecular  evolution  methods  of  protein  design.  We  have 
validated  the  SCHEMA  algorithm  for  designing  chimeric  sequences  and  libraries,  from 
which  we  can  identify  useful,  new  proteins. 


2)  Summary  of  accomplishments 

Protein  dynamics  in  a  family  of  laboratory  evolved  thermophilic  esterases  (Wintrode): 

As  part  of  this  project,  we  studied  how  the  evolution  of  thermostability  influences 
protein  dynamics,  in  collaboration  with  the  Goddard  group  in  theoretical  chemistry. 
Molecular  dynamics  simulations  were  employed  to  study  how  protein  solution  structure  and 
dynamics  are  affected  by  adaptation  to  high  temperature.  Simulations  were  carried  out  on  a 
para-nitrobenzyl  esterase  (484  residues)  and  two  thermostable  variants  that  were  generated 
by  laboratory  evolution.  Although  these  variants  display  much  higher  melting  temperatures 
than  wild-type  (up  to  18  °C  higher)  they  are  both  >97%  identical  in  sequence  to  the  wild- 
type.  In  simulations  at  300  K  the  thermostable  variants  remain  closer  to  their  crystal 
structures  than  wild-type.  However,  they  also  display  increased  fluctuations  about  their 
time-averaged  structures.  Additionally,  both  variants  show  a  small  but  significant  increase 
in  radius  of  gyration  relative  to  wild-type.  The  vibrational  density  of  states  was  calculated 
for  each  of  the  esterases.  While  the  density  of  states  profiles  is  similar  overall,  both 
thermostable  mutants  show  increased  populations  of  the  very  lowest  frequency  modes  (<10 
cm-1),  with  the  more  stable  mutant  showing  the  larger  increase.  This  indicates  that  the 
thermally  stable  variants  experience  increased  concerted  motions  relative  to  wild-type. 
Taken  together,  these  data  suggest  that  adaptation  for  high  temperature  stability  has  resulted 
in  a  restriction  of  large  deviations  from  the  native  state  and  a  corresponding  increase  in 
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smaller  scale  fluctuations  about  the  native  state.  These  fluctuations  contribute  to  entropy 
and  hence  to  the  stability  of  the  native  state.  The  largest  changes  in  localized  dynamics 
occur  in  surface  loops,  while  other  regions,  particularly  the  active  site  residues,  remain 
essentially  unchanged.  Several  mutations,  most  notably  L313F  and  H322Y  in  variant  8G8, 
are  in  the  region  showing  the  largest  increase  in  fluctuations,  suggesting  that  these 
mutations  confer  more  flexibility  to  the  loops.  As  a  validation  of  our  simulations,  the 
fluctuations  of  Trpl02  were  examined  in  detail,  and  compared  with  Trpl02 
phosphorescence  lifetimes  that  were  previously  measured.  Consistent  with  expectations 
from  the  theory  of  phosphorescence,  an  inverse  correlation  between  out-of-plane 
fluctuations  on  the  picosecond  time  scale  and  phosphorescence  lifetime  was  observed. 


Computational  approaches  to  improve  molecular  evolution  searches; 

The  computational  algorithm  SCHEMA  was  developed  to  estimate  the  disruption 
caused  when  amino  acid  residues  that  interact  in  the  three-dimensional  structure  of  a 
protein  are  inherited  from  different  parents  upon  recombination  (Voigt  et  al.,  Nature 
Structural  Biology,  9,  553-558  (2002).  We  then  showed  that  the  simple  distance  metric 
used  by  SCHEMA  to  identify  interactions  and  compute  crossover  disruption  allows  one  to 
predict  which  chimera  sequences  are  most  likely  to  retain  their  function  (Meyer  et  al., 
Protein  Science,  12  (8)  (2003).  This  approach  can  be  used  to  evaluate  crossover  sites  for 
recombination  and  to  create  highly  mosaic,  folded  chimeras. 

With  ARO  support  we  have  explored  recombination  in  a  key  enzyme  family,  the 
cytochrome  P450s  (CYP).  Recombination  generates  chimeric  proteins  whose  ability  to  fold 
and  function  depends  on  minimizing  structural  perturbations  that  result  when  portions  of 
the  sequence  are  inherited  from  different  parents.  If  the  folded  structure  is  retained,  the 
new  chimeric  sequences  can  display  functional  properties  characteristic  of  the  parents;  they 
can  also  acquire  entirely  new  functions.  SCHEMA  was  used  to  design  chimeras  of  two 
members  of  the  CYP102  subfamily  of  the  functionally  diverse  cytochrome  P450  heme 
enzymes.  Seventeen  double-crossover  chimeras  were  constructed  from  the  heme  domains 
of  CYP  1 02 A 1  and  CYP102A2  (63%  amino  acid  identity).  Chimeras  predicted  to  have 
limited  structural  disruption  displayed  CO-binding  spectra  characteristic  of  folded  P450s, 
and  many  were  catalytically  active.  Even  this  small  population  of  chimeras  exhibited 
significant  functional  diversity;  chimeras  displayed  altered  substrate  specificity  profiles,  a 
wide  range  in  thermostabilities,  up  to  a  40fold  increase  in  peroxidase  activity,  and  ability  to 
hydroxylate  a  substrate  towards  which  neither  parent  shows  detectable  activity.  These 
results  suggest  that  SCHEMA-guided  recombination  can  generate  diverse  P450s  for 
exploring  the  evolution  of  function  within  the  P450  structural  framework. 

We  are  now  creating  large  libraries  of  diverse  P450s  by  structure-guided  recombination. 
We  expect  that  a  significant  fraction  will  fold  properly  and  that  many  will  exhibit  novel, 
and  potentially  useful,  function.  Our  goal  is  to  assess  the  degree  of  functional  evolution 
that  is  possible  using  this  evolutionary  search  strategy.  We  believe  that  structure-guided 
recombination  will  allow  us  to  access  useful  catalysts  rapidly  and  create  libraries  that 
encompass  many  of  the  known  natural  activities,  as  well  as  useful  new  activities. 
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